Corpus: hau_wikipedia_2021_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 97 98 99 99 99
1000 754 948 981 984 985
10000 4122 7825 9244 9665 9829
100000 4123 7826 9245 9666 9830
1000000 4123 7826 9245 9666 9830


Zipf's diagram for sentence endings


Gnuplot diagram

1325 msec needed at 2021-07-11 07:01